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ABSTRACT 


Deductive  planning  techniques  are  described  for  rule  selection  In  question- 
answering systems.  Rules  typically  represent  Inferential  knowledge 
applicable  to  a given  domain  of  discourse.  Given  a question-answering 
system  containing  a large  number  of  such  rules,  a crucial  problem  arises 
in  selecting  those  rules  that  are  relevant  and  needed  to  answer  particular 
Input  queries.  A planning  process  has  been  implemented  to  find  chains 
of  rules  that  deductively  Infer  the  desired  conclusions.  The  processes  of 
forward  chaining  from  assumptions  and  backward  chaining  from  goals  are 
combined  to  form  middle- term  chains  which  provide  the  basis  of  the  planning 
mechanism.  The  applicability  and  generality  of  these  techniques  for  use 
In  other  rule-based  systems  is  also  discussed. 
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P.  Klahr 

1.  Introduction 

An  Increasingly  Important  problem  In  rule-based  Inference  systems  Is  one  of 
selecting  rules  that  are  relevant  to  answer  a user's  Input  query  or  to  solve  a 
particular  problem.  This  rule-selection  problem  becomes  crucial  In  systems 
containing  a large  number  of  sucn  rules.  Given  a query  that  needs  deductive 
support,  1.e.»  a query  that  cannot  be  answered  by  direct  retrieval  from  the  file 
of  specific  facts,  the  system  must  find  Us  way  among  the  potentially  large  set 
of  general  statements  (rule-based  knowledge)  and  discover  a subset  that  Is  rele- 
vant and  sufficient  to  answer  the  given  query.  Without  significant  guidance  and 
direction,  the  system  may  get  hopelessly  lost  in  generating  Irrelevant  Inferences. 
New  planning  techniques  have  been  designed  and  Implemented  for  selecting  rele- 
vant rules.  Although  the  techniques  have  been  developed,  and  will  be  discussed, 
within  the  framework  of  a quest  Ion -answering  system  [11,12],  they  should  be 
applicable  to  other  systems  that  use  rules  as  the  basis  for  Inferring  and  deduc- 
ing new  or  Implicit  knowledge. 

Early  work  In  question  answering  used  special-purpose  deductive  mechanisms. 

The  most  common  was  the  set-inclusion  logic  of  SIR  [19],  CONVERSE  [10],  and 
SYNTHEX  [24].  With  the  development  of  the  "resolution  principle"  In  mechanised 
theorem- proving  [21],  several  researchers  have  Incorporated  resolution  techniques 
Into  question-answering  systems  for  purposes  of  more  general-purpose  (domain- 
independent)  deduction.  Most  notable  among  these  was  the  system  developed  by 
Green  [8].  But  problems  with  explosive  search  space,  even  for  small  numbers  of 
axioms  and  theorems,  and  with  canonical  forms  that  are  difficult  for  users  to 
comprehend,  have  led  many  researchers  to  find  alternative  representations  and 
deductive  techniques.  Even  with  the  various  resolution  strategies  that  have  been 
developed  (Chang  and  Lee  [5]  discuss  most  of  these),  very  few  current  systems  use 
resolution  as  a basis  for  question  answering  (the  Maryland  system  [44]  being  a 
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As  an  alternative  to  resolution,  the  use  of  more  "natural"  deduction  techniques 
have  become  common  In  question-answering,  problem-solving,  and  even  theorem- proving 
systems.  Such  techniques  usually  Involve  the  use  of  goal-oriented  backward  chain- 
ing or  sub-goal  generation  as  a basis  for  deduction.  Given  an  Input  query  or  goal, 
rules  whose  consequents  match  the  goal  are  Invoked,  and  the  antecedents  of  the  rules 
are  set  up  as  subgoals.  The  process  repeats  recursively.  Such  backward  chaining 
techniques  are  found  In  several  rule-based  systems,  e.g..  In  production  systems 
such  as  MYCIN  [6]  and  RITA  [1];  In  PLANNER- like  language  systems  [9,4]  where 
rules  are  In  the  form  of  procedures;  and  In  several  theorem-proving  systems 
[2,15,20].  Rule  selection  In  these  systems  ranges  from  an  emphasis  on  user 
Interaction  In  choosing  rules  [3],  to  having  an  ordered  recommendation  list 
of  rules  to  apply  [9],  to  picking  up  all  applicable  rules  [6]. 

The  process  of  backward  chaining  proceeds  in  one  direction,  backward  from  a 
goal.  In  many  cases,  backward  chaining  will  lead  to  dead  ends,  l.e.,  subgoals  which 
cannot  be  deduced,  found  In  the  data  base,  or  supplied  by  the  user.  As  backward 
chaining  proceeds,  rules  are  applied  if  their  consequents  match  the  desired  goal. 
Substitutions  for  variables  In  the  rules  occur  during  the  matching  process  and  are 
passed  on  to  variables  In  rules  selected  for  further  backward  chaining.  This  process 
thus  verifies  the  consistency  of  the  variable  substitutions  In  the  course  of  selec- 
ting and  applying  rules.  But  the  effort  expended  in  this  verification  Is  wasted  for 
those  cases  where  chaining  leads  to  dead  ends.  The  planning  mechanism  discussed 
below  separates  the  selection  process  from  the  verification  process.  Verification 
(consistency  of  substitutions)  Is  delayed  until  a set  of  rules  has  been  selected 
as  being  relevant  to  the  particular  problem,  thus  avoiding  the  verification  of  many 
fruitless  deductive  paths. 


I 


The  approach  to  rule  selection  taken  here  Is  one  of  generating  derivation  plans, 
or  proof  skeletons,  representing  possible  deductions  needed  to  answer  queries.  The 
planning  process  combines  backward  chaining  from  goals  with  forward  chaining  from 
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Assumptions  to  form  deductive  chains  through  the  rules.  The  generation  of  these 
resultant  "middle- term"  chains  Is  the  basic  mechanism  used  for  rule  selection. 

In  generating  middle-term  chains,  the  system  Is  aware  of  the  deductive  Interactions 
among  the  rules  but  It  Is  not  concerned  with  the  verification  of  variable  substitu- 
tions within  chains  or  proof  plans.  Such  verification  occurs  after  proof  planning. 
This  paper  focuses  on  defining  middle-term  chaining  and  showing  how  It  Is  used  In 
proof  planning. 

2.  Organization  and  Representation  of  Knowledge 

Within  our  Knowledge  base,  we  will  distinguish  between  facts  and  rules.  Facts  are 
represented  In  a relational  format,  l.e.,  a relation  (predicate)  followed  by  Its 
arguments.  Examples  of  facts  are  GKEEK( Socrates) , ATTEND(Sm1th,ACM76) , CONOUCTS- 
RESEARCN-AT( Jones, MIT).  Rules  are  general  statements  which  the  system  uses  to 

deduce  new  or  Implicit  Knowledge.  They  are  In  the  form  of  Implications.  The  left- 
hand  side  of  a rule  represents  the  set  of  assumptions  or  conditions  of  the  rule, 
while  the  right-hand  side  represents  the  set  of  goals  or  actions  of  the  rule.  Con- 
junctions, disjunctions,  and  negations  can  occur  on  either  side  of  the  implication. 

An  example  rule  is  A(ATTENO(x,z) , ATTEND(y.z))  D SCIENTIFIC-C0NTAC7(x,y) , which  states 
that  If  two  scientists  (x  and  y)  attend  the  same  conference  (z)  then  scientific 
contact  can  occur  between  them.  All  variables  occurring  in  rules  In  this  paper 
will  be  considered  universally  quantified  (with  optional  domain  class  restrictions, 
e.g.,  scientist,  conference,  etc.).  For  the  use  of  rules  containing  existentially 
quantified  variables,  see  Klahr  [12], 

The  primary  motivation  for  distinguishing  rules  and  facts  Is  that  the  deductive 
processor  operates  on  the  rules,  while  a data  management  system  accesses  and 
retrieves  facts.  This  organization  is  in  sharp  contrast  to  most  resolution-based 
systems,  which  combine  rules  (axioms  and  theorems)  and  facts  Into  a single  flic. 

The  distinction  between  rules  and  facts  being  made  here  Is  similar  to  the  separation 
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of  rules  and  data  base  found  in  typical  production  systems  [7],  However, 
the  control  structure  and  the  interface  between  rules  and  data  base  is  more 
constrained  in  our  system.  We  are  concerned  with  domains  containing  large  nunbers 
of  rules  and  facts.  Our  primary  focus  is  on  developing  techniques  for  locating 
relevant  rules  within  our  deductive  processor.  The  problem  of  searching  over  a 
large  file  of  facts  is  also  a significant  problem,  but  it  is  not  one  we  are  address- 
ing here.  The  data  management  system  we  have  written  is  satisfactory  for  use  in  our 
initial  experimentation,  but  we  are  seeking  to  interface  the  deductive  system  with 
a more  sophisticated  data  management  system  for  retrieval  over  much  larger  files  of 
specific  facts.  We  want  this  interface  to  be  efficient  and  infrequent  and  to  be 
used  only  when  facts  are  needed  in  a proof.  The  last  example  presented  in  this 
paper  involves  such  an  interface. 

3.  Deductive  Interactions 

Before  describing  middle- term  chains  and  proof  planning,  we  will  first  show  some 
simple  examples  of  the  use  of  rules  in  deduction.  Figure  la  shows  that  if  we  know 
or  assume  some  proposition  A and  if  we  have  a rule  that  states  that  A implies  B, 
then  we  can  infer  B.  Figure  lb  shows  that  we  can  deduce  that  Joe  is  a human,  given 
that  he  is  a man  and  that  all  men  are  human.  This  deduction  requires  the  substitu- 
tion of  "Joe"  for  the  variable  x.  This  substitution  is  found  by  a pattern  matcher 
which  operates  on  the  argument  strings  of  the  relation  being  considered.  We  will 
refer  to  a successful  pattern  match  as  a "unification"  (after  Robinson  [21]). 
Unifications  represent  deductive  interactions  and  are  shown  as  vertical  lines  in 
Figure  1 and  in  subsequent  figures. 

In  Figure  1c,  two  rules  are  needed  to  deduce  that  Joe  is  a mammal.  The  two 
rules  deductively  interact  through  a unification  between  the  two  occurrences  of  the 
predicate  HUMAN.  Note  the  consistency  of  the  variable  substitutions  in  the  three 
unifications:  The  variable  x must  equal  Joe;  the  variable  y must  equal  x;  and  y 
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must  equal  Joe.  We  will  later  see  how  these  variable  substitutions  are  combined 
and  used  In  proof-plan  verification.  Note  also  the  structural  pattern  In  Figure 
1c»  a unification  followed  by  an  Implication,  followed  by  a unification,  etc., 
forming  a deductive  Implication  chain  between  MAN(Joe)  and  MAMMAL(Joe)  through  the 
two  given  rules. 

4.  Middle-Term  Chains 

Consider  the  example  In  Figure  2.  For  purposes  of  simplicity,  only  predicate 
names  are  shown  In  the  query  and  In  the  rules.  The  argument  strings  of  the 
predicates  have  been  suppressed.  The  Input  query  Is  broken  down  Into  the  assumption 
A and  the  goal  D.  Suppose  the  three  given  rules  are  known  to  the  system.  The  sub- 
script attached  to  each  predicate  serves  to  Identify  the  rule  number  In  which  the 
predicate  occurs.  Suppose  that  unifications  exist  between  the  two  occurrences  of 
B and  between  the  two  occurrences  of  C.  Suppose,  further,  that  unifications  exist 
between  the  assumption  A and  the  occurrence  of  A in  rule  1,  l.e. , Aj,  and  between 
the  goal  D and  03>  Then  we  have  the  middle-term  chain  shown  In  Figure  2.  A^, 
which  unifies  with  the  assumption.  Implies  Bj,  which.  In  turn,  unifies  with  6^, 
which  Implies  Cg.  etc.  The  relations  B and  C are  middle-term  predicates,  i.e., 
relations  needed  to  deductively  link  the  assumption  to  the  goal.  The  predicate 
occurrences  B^,  Bg.  C2,  and  are  middle-term  predicate  occurrences.  A^  and  Dj, 
the  chain  end  points,  are  occurrences  of  the  assumption  and  the  goal,  respectively. 

The  proof  plan  Involving  this  middle-term  chain  Is  also  shown  In  Figure  2.  The 
plan  becomes  a completed  proof  If  the  variable  substitutions  Involved  In  the 
unifications  are  consistent,  l.e.,  no  variable  takes  on  two  different  constants  as 
Its  value.  Plan  verification  will  be  shown  in  later  examples. 


5.  Predicate  Connection  Graph 

In  generating  middle-term  chains,  the  system  uses  a net  structure  called  the 


Query:  A^D 

Assumption:  A 


Goal:  D 

Rules: 

(1)  A^B, 

(2)  B2  D C2 

(3)  C3  3 D3 


iviiddle-Term  Chain: 
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predicate  connection  graph.  This  graph  Is  an  abstraction  of  the  rules  known  to 
the  system.  It  contains  Information  about  the  deductive  interactions  among  the 
rules  and  the  Implications  within  the  rules.  As  rules  are  entered  Into  the 
system,  unifications  between  them  and  the  other  rules  are  pre-computed  and 
stored.  The  existence  of  unifications  is  stored  in  a unifications  array  while  the 
substitution  lists  associated  with  the  unifications  are  stored  in  a variable- 
substitutions  array.  The  chain-generation  and  proof-planning  processes  use  the 
Information  about  the  existence  of  unifications  but  need  not  be  concerned  with  the 
substitution  lists  required  by  the  unifications.  Once  a global  plan  has  been 
developed,  the  plan  verifier  will  combine  the  substitution  lists  of  the  unifica- 
tions in  the  plan  and  check  for  substitution  consistency. 

The  predicate  connection  graph  bears  some  resemblance  to  the  graphs  used  in 
the  proof  procedures  of  several  theorem- proving  systems  [13,25,26].  The  similarity 
lies  mainly  in  representation,  i.e.,  using  a graph  to  represent  deductive  inter- 
actions. The  primary  difference  is  in  the  way  the  graph  is  used.  Here  the  graph 
is  used  as  a planning  tool  to  generate  possible  deductive  paths  through  the  rules. 

The  emphasis  is  on  using  the  graph  to  locate  potentially  relevant  rules  from  a 
large  set.  The  graph  does  not,  by  itself,  generate  proofs,  as  it  does  in  these 
other  systems. 

For  the  simple  example  in  Figure  2,  we  note  that  if  the  assumption  A and 
the  goal  0 were  removed  from  the  proof  plan  shown,  the  resulting  structure 
would  essentially  be  identical  to  the  predicate  connection  graph  that  would 
exist  for  the  three  given  rules.  The  chain-generation  process  finds  deduc- 
tive paths  through  this  graph.  For  larger  rule  sets,  a proof  plan  would 
typically  be  a small  subset  of  the  deductive  Interactions  contained  in  the 
predicate  connection  graph. 

It  is  Important  to  summarize  the  use  of  the  pattern  matcher  and  the  use  of  the 
substitution  lists  that  result.  First,  the  pattern  matcher  Is  invoked  when  a 
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new  rule  Is  entered  Into  the  system.  Unifications  of  the  new  rule  with  existing 
rules  are  determined  and  stored.  Second,  for  a given  Input  query,  the  pattern 
matcher  determines  possible  chain  Initialization  points,  l.e..  It  finds  predicate 
occurrences  within  the  rules  which  unify  with  query  assumptions  and  goals.  And 
third,  after  proof  planning,  the  substitution  lists  of  the  unifications  In  the 
plan  are  combined  and  verified  During  proof  planning  the  existence  of  unifi- 
cations, as  determined  by  the  pattern  matcher.  Is  used,  but  the  pattern  matcher 
Itself  Is  not  Invoked. 

Thus  most  of  the  work  done  by  the  pattern  matcher  occurs  In  a pre-processor 
for  entering  rules  Into  the  system  and  in  proof -plan  verification,  which  fills  out 
the  details  of  the  proof.  This  Is  In  contrast  to  most  resolution-based  systems, 
production  systems,  and  PLANNER-1  Ike  systems  which  continually  verify  substitutions 
In  the  course  of  finding  a proof  or  answering  a given  query.  Although  the  idea  of 
suppressing  details  is  not  new  (e.g.,  Newell,  Shaw,  and  Simon  [16]),  very  few  sys- 
tems use  different  stages  of  processing  to  operate  on  varying  levels  of  detal1. 

A notable  exception  Is  the  work  of  Sacerdotl  [22,23],  whose  problem-solving  systems 
show  considerable  promise  in  the  use  of  global  planning  and  the  suppression  of 
details  for  later  processing. 

6.  Forward  and  Backward  Chaining 

The  middle-term  chaining  process  can  be  visualized  as  one  which  generates 
expanding  wave  fronts  forward  from  assumptions  and  backward  from  goals.  The 
query  assumptions  are  unified  with  occurrences  of  the  same  predicates  in  the 
rules.  The  successfully  unified  occurrences  represent  possible  chain  starting 
points.  Similarly,  those  occurrences  unifying  with  query  goals  represent 
possible  chain  end  points.  From  then  on  the  system  uses  the  predicate  connec- 
tion graph  exclusively  to  find  implication  links  and  unifications  In  each 
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dlrection.  In  the  forward  direction.  Implication  links  from  the  unified 
assumption  occurrences  are  extracted  from  the  predicate  connection  graph. 

Then  occurrences  unifying  with  the  implied  occurrences  are  extracted,  etc. 

The  same  process  occurs  backward  from  the  goal  occurrences.  When  an  Inter- 
section occurs  between  the  two  wave  fronts,  the  system  has  found  a middle- 
term  chain.  (See  Nilsson  [17]  for  a general  discussion  of  graph  searching 
techniques  and  Pohl  [18]  for  strategies  *n  bi-directional  graph  searching 
and  Intersecting.) 

Let  us  look  at  an  example  of  how  the  chain  in  Figure  2 was  found  for  the  given 
query  from  a larger  set  of  rules.  Suppose  that  in  addition  to  the  three  rules 
shown  In  Figure  2,  the  following  rules  also  existed  in  the  system:  (4)  d E^, 

(5)  Eg  3 Fg,  (6)  Gg  3 Hg,  and  (7)  H7  3 D^.  Suppose  also  that  unifications 
exist  between  E^  and  Eg,  between  Hg  and  H^,  between  and  the  query  assumption 
A,  and  between  07  and  the  query  goal  0.  The  predicate  connection  graph  for  the  seven 
rules  is  shown  in  Figure  3.  Possible  chain  starting  points  are  A^  and  A^.  Possible 
chain  end  points  are  03  and  D^.  Forward  chaining  from  A^  and  A^  yields  B-|  and  E^. 
Unifications  from  and  E^  areB^  and  Eg.  Implications  from  B2  and  Eg  areC^ 
and  Fg.  Backward  chaining  from  03  and  07  yields  C3  and  H7<  Unifications  from  C3 
and  H7  areC2  and  Hg.  There  is  now  an  Intersection  point  between  the  two  wave 
fronts  at  C2.  The  resulting  chain  is  A1 -Bj ■B2"C2"C3'D3*  In  actual  operation, 
forward  chaining  and  backward  chaining  proceed  alternately  one  step  at  a time 
until  a chain  is  found  (or  until  effort  limits  have  been  exceeded).  The  chain 
found  is  the  only  one  that  exists  for  this  example  set  of  seven  rules.  Forward 
chaining  to  Fg  is  a dead  end  as  is  backward  chaining  to  Hg  (and  further  back  to 


» , 


66)- 

Note  that  the  chaining  processes  found  in  most  other  systems  will  verify  as 
they  proceed,  l.e.,  substitutions  for  variables  in  the  argument  strings  will  be 
found  and  combined  as  the  system  proceeds  from  A1  to  B1  to  B2,  etc.  Similarly 
they  would  also  verify  as  they  proceed  from  A4  to  E^  to  Eg  to  Fg.  Our  system 
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Rules: 


(1) 

A1 

= B1 

(4) 

A4  = E4 

(2) 

eg 

m 

U 

o 

M 

(5) 

E5  = F5 

<3) 

C3 

=»  °3 

(6) 

= H6 

(7) 

H?  =>  D? 

Predicate  Connection  Graph: 


a4  =>  1 

■4 

G6  =>  H 

b2  = 92 

1 

% = F5 

H 

c3  = d3 


D7 


II 


Middle-Term  Chaining: 

Forward  Backward 


point 


Figure  3.  Middle-term  chaining  through  a larger  rule  set 
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verifies  only  after  a proof  plan  (In  this  case  the  single  middle-term  chain) 
has  been  found.  The  verification  of  dead  ends  In  this  example  Is  avoided. 

The  process  of  chain  generation  Is  quite  fast,  owing  largely  to  the  representa- 
tion and  storage  of  Information  within  the  predicate  connection  graph.  Each 
predicate  occurrence  In  the  rules  Is  assigned  a unique  Integer,  e.g.,  Is 
assigned  1,  Is  2,  B2  Is  3,  etc.  The  set  of  unifications  out  of  a particular 
occurrence  Is  given  simply  by  a list  of  Integers,  those  representing  the  unifying 
occurrences.  The  unifications  are  stored  In  an  array  Indexed  on  the  occurrence 
Integers.  Thus,  to  find  the  unifications  existing  out  of  the  predicate  occurrence 
whose  assigned  integer  Is  2,  i.e.,  Bj , the  system  simply  accesses  the  second 
element  In  the  unifications  array.  A similar  process  occurs  In  the  storage  and 
retrieval  of  Implication  links  occurring  within  rules.  These  Implications  are 
stored  In  another  special-purpose  array.  Thus,  once  chain  Initialization  points 
are  found,  the  system  executes  its  forward  and  backward  chaining  by  Indexing  Into 
the  two  arrays. 

The  segmentation  of  Information  into  different  arrays  has  greatly  facilitated 
the  organization  and  processing  efficiency  of  the  system.  The  plan  verification 
processor,  for  example,  uses  only  the  variable-substitutions  array  to  retrieve  the 
substitution  lists  of  the  unifications  involved  in  a proof  plan.  Such  segmentation 
will  be  of  great  importance  when  main  memory  cannot  accommodate  a very  large  set 
of  rules.  The  various  arrays  can  be  swapped  in  and  out  of  main  memory  along  with 
the  associated  processor.  In  the  current  version  of  the  deductive  system,  seven 
arrays  are  used.  Implementation  details  are  given  In  Klahr  [12], 

Even  with  such  efficient  storage  and  retrieval  of  deductive  and  Implication 
Information,  the  expanding  wave  fronts  can  become  quite  large,  especially  If  there 
are  a large  number  of  deductive  Interactions  among  the  rules.  Three  primary  methods 
are  used  In  reducing  and  ordering  the  Implications  and  unifications  picked  up  at 
any  point  In  mlddlt-term  chaining.  These  Include  the  use  of  semantic  Information, 
user-supplied  advice,  and  plausibility  measures. 
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The  pattern  matcher  finds  unifications  between  predicate  occurrences  by 
finding  substitution  lists  that  make  the  argument  strings  of  the  occurrences 
Identical.  While  matching  two  argument  strings,  the  pattern  matcher  also  checks 
the  domain  classes  of  the  variables  and  constants  being  matched.  When  rules  or 
queries  are  entered  Into  the  system,  the  user  can  specify  domain  classes  for 
any  variables  or  constants  Involved.  For  example,  he  could  specify  the  variable 
x as  being  a scientist,  the  variable  y as  being  a politician,  and  the  constant 
"Einstein"  as  being  a scientist.  Then  the  pattern  matcher  would  allow  the 
substitution  of  Einstein  for  x,  but  would  not  allow  the  substitution  of  Einstein 
for  y nor  the  substitution  of  x for  y,  since  they  belong  to  different  domains. 
Domain-class  specifications  can  also  include  conjunctions,  e.g.,  scientist  and 
teacher;  disjunctions,  e.g.,  scientist  or  politician;  and  negations,  e.g.,  not 
scientist.  The  use  of  such  semantic  domain  types  can  greatly  reduce  the  number 
of  deductive  Interactions  that  exist  among  the  rules  as  well  as  reduce  the 
number  of  chain- Initialization  points  for  a given  query. 

We  are  currently  expanding  the  use  of  domain  types  to  Include  domain  subsets 
and  supersets.  This  would  allow  successful  matches  to  occur  between  elements 
belonging  to  domains  In  which  one  domain  is  a subset  of  the  other,  e.g.,  between 
a variable  that  is  a scientist  and  one  that  Is  a human. 

The  system  also  allows  the  user  to  supply  advice  to  the  system.  Advice  can  be  ^ 
entered  for  a particular  query  or  It  can  be  stored  In  a more  permanent  advice  file 
which  the  system  accesses  for  each  goal.  The  user  can  specify  that  particular 
predicates  and  rules  be  used  In  deductive  chaining.  The  advice  Is  transformed 
Into  predicate  and  rule  alert  lists  (as  well  as  negative  alert  lists  for  negative 
advice),  which  serve  to  order  Implications  and  unifications  within  middle-term 
chaining.  In  the  case  of  advised  predicates,  the  system  attempts  to  find  chains 
through  occurrences  of  those  predicates.  For  advised  rules,  the  system  tries  to 
chain  through  them  whenever  possible.  Unifications  are  ordered  at  each  point  In 
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the  chaining  process  so  that  unifications  to  advised  rules  are  given  priority. 
Ordering  rules  in  this  way  is  similar  to  the  use  of  meta-rules  to  order  rule- 


selection  in  [61,  although  here  it  occurs  in  proof  planning. 

The  use  of  advice  gives  a user  much  flexibility.  If  he  can  aid  the  system 
by  specifying  particular  rules  or  predicates  as  being  important,  the  system  can 
take  advantage  of  this.  In  addition,  he  can  try  to  find  alternative  proofs 
using  different  advice  to  see  if  certain  rules  are  appropriate  or  to  get  alter- 
native derivations.  In  question-answering  systems  it  is  often  useful  to  find 
alternative  proofs  to  give  more  credence  to  the  conclusion,  particularly  If 
rules  have  varying  degrees  of  plausibility.  The  system  does  not  necessarily 
stop  with  the  first  completed  proof.  At  the  user's  request,  it  will  continue 
to  find  alternative  derivations  with  or  without  advice. 

The  final  method  used  in  ordering  and  selecting  elements  in  middle-term 
chaining  is  through  the  use  of  plausibility  measures.  These  measures  are 
similar  to  the  certainty  factors  used  in  MYCIN  [6].  The  plausibility  of  a rule 
represents  the  degree  of  certainty  that  the  antecedents  of  the  rule  imply  the 
consequents.  Plausibilities  can  be  reset  by  the  user  at  any  time.  Unifications 
are  ordered  during  chaining  so  that  those  associated  with  highly  plausible  rules 
are  picked  up  first,  although  advised  rules  are  given  highest  priority  regardless 
of  plausibility. 

7.  Generating  and  Verifying  Proof  Plans 

For  each  middle-term  chain  generated,  the  system  extracts  the  rules  containing 
the  chain  occurrences  and  determines  If  any  subproblems  result  from  the  rules.  In 
Figure  4,  the  middle-term  chain  shown  Is  the  same  as  the  one  generated  in  Figure 
2.  (Once  again,  argument  strings  have  been  suppressed.)  However,  the  rules 
are  more  complex.  In  the  first  rule,  both  Ej  and  Aj  are  needed  as  conditions  for 
Inferring  Bj.  Thus  Ej  Is  set  up  as  a subproblem.  In  the  second  rule,  Fj  Is  not 


B2  3 &(c2,  F2) 


v(G3.  c3)  d v{D3,  H3) 

0 

Figure  4.  Chaining  through  more  complex  rules. 


a subproblem  since  B2  Infers  C2  independently  of  F2.  Similarly,  6j  Is  not  a 
subproblem  In  the  third  rule.  H^,  however.  Is  a subproblem.  To  Infer  by 
this  proof,  we  must  show  the  negation  of  to  be  true. 


Each  subproblem  Is  set  up  for  either  deductive  support  through  the  rules, 
for  data-base  support  from  the  file  of  specific  facts,  or  for  computation.  The 
latter  occurs  for  predicates  which  are  defined  by  user-supplied  procedures.  When 
such  a predicate  occurs  as  a subproblem,  Its  corresponding  procedure  Is  executed 
to  determine  Its  truth  value  (e.g.,  predicates  such  as  GREATER-THAN).  The  user 
can  also  specify  that  certain  predicates  be  given  data-base  support.  These  would 
typically  be  predicates  that  are  completely  defined  by  the  set  of  facts  to  which 
they  apply,  or  predicates  about  which  the  user  can  supply  the  missing  Information. 

In  the  latter  case  the  system  would  output  conditional  answers,  i.e.,  answers 
which  can  be  concluded  if  the  user  can  supply  the  remaining  information.  All 
other  predicates  occurring  as  subproblems  are  set  up  for  deductive  support  to 
be  obtained  by  recursive  calls  on  the  deductive  system. 

Let  us  look  at  more  concrete  examples.  Consider  the  example  shown  In  Figure 
5.  The  system  needs  to  find  a value  for  y,  the  place  where  Socrates  lives,  given 
the  two  assumptions.  Suppose  the  system  found  the  middle-term  chain  given  by  the 
occurrences  involved  In  the  unifications  u^,  u^and  u3>  The  two  rules  shown  state 
that  if  someone  is  the  husband  of  another  then  they  are  married;  and  married 
people  live  in  the  same  place.  The  resulting  subproblem  LIVES-INtx^.Xg)  Is  resolved 
directly  from  an  assumption  as  shown  by  unification  u*.  The  resulting  structure 
represents  a plan  and  Is  not  yet  a proof,  since  we  have  not  analyzed  the  variable 
substitutions  In  the  plan.  This  Is  done  by  the  plan  verifier  when  there  are  no 
remaining  deductive  subproblems  (although  data-base  and  compute  subproblems  may 
still  exist). 

The  plan  verifier  extracts  the  substitutions  of  the  unifications  In  the  plan 
and  creates  variable-flow  classes.  Each  class  specifies  a list  of  elements,  I.e., 
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Query:  &(HUSBAND(Socrates,Xanthippe).  LIVESIN(Xanthippe.Athens)) 

3 LI  VES-IN  (Socrates, y) 


Assumptions:  HUSBAND(Socrates,Xanthippe),  LIVES-IN(Xanthippe.Athens) 
Goal:  LIVES-IN(Socrates.y) 

Proof  Plan: 

HUSBAND(Socrates, Xanthippe) 

LIVES-IN(Xanthippe.Athens) 
HUSBAND(xvx2)  3 MARRIEDlx^)^"^ 


&(MARRIED(x3,x4),  UVES-IN(x4.x5))  3 LIVES  IN(x3,x5) 

/«  3 

LI V ES- 1 N ( Socra  tes.y ) 

Variable-Flow  Classes:  (Socrates.  x1#  x3)  (Xanthippe.  x2.  x4) 


(Athens.  x5.  y) 


Data-Base  Requests:  None 


Answer:  y » Athens 


Figure  5.  Deduction  showing  proof  plan  and 

variable-flow  (verification)  classes. 
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variables  and  constants,  that  must  be  equal  for  the  proof  to  be  logically  valid. 

In  the  example,  must  equal  Socrates  from  unification  Uj.  From  u2,  It  must 
also  equal  x3,  which.  In  turn,  must  equal  Socrates  from  u3.  The  variable-flow 
class  (Socrates,  x^,  x3)  Is  thus  formed.  Following  another  variable  flow,  we  see 
that  x2  equals  Xanthippe  from  Uj , and  equals  x4  from  u2<  The  variable  x4.  In  turn, 
equals  Xanthippe  from  u4>  The  resulting  variable-flow  class  Is  (Xanthippe, 
x2,  x4).  Finally,  If  we  follow  the  flow  of  y,  we  see  that  from  u3,  y equals 
x5  which.  In  turn,  equals  Athens  from  u4,  forming  the  class  (Athens,  x&,  y). 

Note  that  In  each  class  there  Is  at  most  one  distinct  constant.  If  any  variable- 
flow  class  contains  two  different  constants,  the  proof  plan  falls  to  verify.  In 
this  example,  there  are  no  substitution  inconsistencies,  and  the  plan  successfully 
verifies. 

Since  the  proof  plan  contains  no  remaining  subproblems  needing  evaluation  or 
data-base  search,  the  plan  is  a complete  proof.  The  system  then  outputs  an  answer. 

If  variables  were  specified  in  the  input  query,  values  for  these  variables  would 
be  displayed.  If  no  variables  exist,  the  system  would  respond  affirmatively  if 
it  found  a successful  derivation.  In  the  example,  the  variable  y occurs  in  the  query. 
The  system  locates  the  variable-flow  class  in  which  y occurs.  Since  the  constant 
"Athens"  Is  In  the  same  class,  It  becomes  the  value  of  y and  the  answer  to  the 
query. 

Consider  the  example  in  Figure  6.  The  query  asks  whether  MIT  knows  about 
Newlisp  which  was  originated  by  Smith.  The  three  rules  used  are  concerned  with 
knowledge  transfer  among  scientists  and  laboratories:  If  the  originator  of  a 
result  has  scientific  (professional)  contact  with  another  scientist,  the  latter 
will  know  of  the  result.  Scientific  contact  between  scientists  can  exist  if 
they  attend  the  same  conference.  If  a scientist  knows  a certain  result  and  con- 
ducts research  at  a particular  laboratory,  then  the  laboratory  also  knows  the 
result. 
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Query:  ORIGINATES(Smith,  Newlisp)  =>  KNOWS  (MIT.  Newlisp) 
Assumption:  ORIGINATESiSmith.  Newlisp) 

Goal:  KNOWS(MIT,  Newlisp) 


Proof  Plan: 


data-base  ~ 
ORIGIN  ATES(Smith,  Newlisp) 

l°1 


&(ATTEND(xy,xp).  ATTEND(xo.xg))  =)  SCIENTIFIC  CONTACKx,  x„) 
data  base  data  base  ' 0 


&(ORIGINATES(xvx2).  SCIENT|FIC-CONTACT(xj,x3))  D KNOWS(x3.x2) 

&(CONDUCTS  RESEARCH  AT(x4.Xfi).  KNOWSU4.x6))  =>  KNOWS(x5.x6) 
data-base  / 


jit  3 


KNOWS(MIT, Newlisp) 


Variable- Flow  Classes:  (Smith,  xj,  x7)  (Newlisp.  x2,  x6) 

(MIT. x5)  (x3, x4, x8)  (xg) 

Data-Base  Requests:  ATTEND  (Smith.  xg) 

ATTEND  (x3.  x9) 

CONDUCTS-RESEARCH-AT  (x3.  MIT) 


Figure  6.  Deduction  requiring  data-base  facts. 
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The  middle-term  chain  generated  Involves  the  unifications  , u2,  and  u3. 

Two  subproblems  are  formed.  The  occurrence  of  CONDUCTS-RESEARCH-AT  Is  to  be 
given  data-base  support.  This  was  specified  previously  by  the  user  either 
because  the  data-base  has  considerable  knowledge  about  the  relation  or  because 
it  can  be  easily  determined  or  found  by  the  user  (e.g.,  knowledge  about  where 
researchers  work). 

The  other  subproblem  Involves  SCIENTIFIC-CONTACT  which  Is  to  be  given 
deductive  support.  The  system  w*s  unable  to  find  a middle-term  chain  for  this 
subgoal,  so  It  chained  back  to  the  rule  shown  via  unification  u4>  (Backward 
chaining  will  occur  whenever  middle-term  chains  cannot  be  found  between  assunp- 
tlons  and  goals  or  when  no  assumptions  exist.  In  such  cases,  the  middle-term 
chain  generator  defaults  to  using  only  backward  chaining.  Techniques  are  being 
developed  whereby  the  system  can  generate  appropriate  assumptions  for  goals  when 
no  assumptions  exist  so  that  middle-term  chaining  can  be  attempted.  One  such 
technique  involves  finding  facts  whose  predicates  apply  to  constants  which  occur 
In  the  goal,  and  using  these  predicates  as  assumptions  [12].)  Two  more  subproblems 
result,  both  requiring  data-base  support.  The  resulting  proof  plan  has  three 
subgoals  to  be  resolved  by  data-base  search. 

The  plan  verifier  forms  the  variable-flow  classes.  No  conflicts  occur,  and 
the  data-base  subproblems  are  set  up  with  respect  to  the  classes.  The  subproblem 
ATTEND(xy,xg)  becomes  ATTEND (Smith, xg),  since  the  constant  "Smith"  is  in  the  same 
class  as  Xy.  ATTEND(xg,xg)  becomes  ATTEND(x3,xg) . In  this  case,  the  variable-flow 
class  containing  xQ  does  not  contain  a constant.  The  variable  Is  then  replaced 
by  the  first  member  In  Its  class.  This  is  done  so  that  all  variables  within  the 
same  class  will  be  specified  as  being  identical  for  data-base  searching.  Thus  x3 
Is  also  substituted  for  x4  In  C0NDUCTS-RESEARCH-AT(x4,x5),  which  becomes  CONDUCTS- 
RESEARCH-AT(x3>MIT),  MIT  being  substituted  for  Xg.  Thus,  the  deductive  system 
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can  conclude  that  MIT  knows  about  Newlisp  If  there  is  a scientist  who  works  at 
MIT  and  who  also  attended  a conference  which  Smith  attended. 

The  data-base  search  requests  are  then  sent  to  a data  management  system  (cur- 
rently a relational  data  management  system  written  In  l !SP  by  the  author  [12]), 
which  attempts  to  find  values  for  the  open  variables  which  would  satisfy  the 
subproblems.  For  example,  if  it  found  that  Smith  attended  ACM76,  which  Jones 
also  attended,  and  that  Jones  works  at  MIT,  then  the  system  would  respond  "yes" 
to  the  original  query.  If  insufficient  information  about  the  subgoal  relations 
existed  in  the  data  base,  the  system  would  output  a conditional  answer.  In  a 
sense,  the  deductive  system  can  be  used  without  any  data  base.  All  data-base 
search  requests  would  be  simply  given  to  the  user  as  renaining  subproblems  for 
him  to  resolve. 

We  can  also  use  the  example  in  Figure  6 to  show  how  variable  and  constant 
domain-class  specifications  can  be  used  to  reduce  search  space.  For  example,  a 
user  could  specify  that  x^ , x^  and  x4  are  scientists.  These  variables  occur  in 
predicates  concerned  with  scientists  origination -results,  knowing  results,  working 
at  laboratories,  etc.  Similarly,  x^  coilld  be'sp^ified  as  being  a laboratory. 
Indicating  that  KN0WS(x5,xfi)  refers  to  laboratories  knowing  results.  In  the  query, 
MIT  could  be  typed  as  a laboratory.  The  pattern  matcher  will  check  domain-class 
compatibility  when  finding  deductive  Interactions.  Thus  MIT  would  match  x&,  since 
they  are  of  the  same  domain  type,  but  would  not  match  Xy  KNOWS ( MI T.Newl isp)  will 
thus  unify  with  KNOWS(xg,Xg)  but  will  not  unify  with  KNOWSfx^.Xg),  thereby  reducing 
the  number  of  potential  unifications. 

8.  Summary  and  Results 

The  main  focus  of  the  research  described  here  has  been  on  developing  techniques 
to  find  relevant  rules  in  deductive  question  answ»  : ing.  However,  we  feel  that 
these  techniques  can  be  used  in  most  rule-based  stems,  including  those  using 
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production  rules  and  those  using  rules  in  the  form  of  predicate  calculus  implica- 
tions. The  processes  of  forward  chaining  and  backward  chaining  have  been  used 
in  previous  rule-based  systems,  but  here  they  are  combined  to  find  middle-term 
chains  which  form  the  basis  of  a planning  process  designed  primarily  for  rule 
selection.  Middle-term  chains  are  found  through  the  use  of  a predicate  connection 
graph  which  contains  Information  about  the  existence  of  deductive  interactions 
between  rules  and  the  existence  of  implications  within  rules.  Middle-term  chains 
are  expanded  into  proof  plans  which  are  verified  when  there  are  no  remaining  sub- 
goals that  require  deductive  support.  Remaining  data-base  subgoals  are  sent  to  a 
data  management  system  for  retrieval  from  the  file  of  specific  facts.  Answer 
extraction  is  straightforward,  given  the  variable-flow  classes  produced  during 
plan  verification  and  updated  after  data-base  search  (returned  data-base  values 
for  variables  are  added  to  the  variable-flow  classes). 

The  primary  features  of  the  deductive  system  may  be  summarized  as  follows: 

1.  Separation  of  rules  and  facts;  deductive  system  operates  on  the 
rules  while  a data  management  system  retrieves  needed  facts. 

2.  Pre-computation  of  deductive  unifications  between  new  rules  and 
existing  rules  when  rules  are  added  to  the  system;  efficient 
storage  and  retrieval  of  deductive  information  within  a predicate 
connection  graph. 

3.  Process  of  middle-term  chaining  to  find  deductive  paths  through 

the  rules.  When  there  are  no  assumptions  or  when  there  are  no  goals, 
middle-term  chaining  defaults  to  backward  chaining  or  forward 
chaining,  respectively. 

4.  Separate  processing  phases  for  rule  selection  (planning  using  middle- 
term  chains)  and  verification. 

5.  Use  of  semantic  domain  classes  to  reduce  search  space. 


6.  Use  of  user  guidance  about  which  rules  or  predicates  may  be  appropriate. 
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The  deductive  planning  system  is  implt;  nted  in  LISP  and  is  operational  on 
an  IBM  370/158.  Our  Initial  experimentation  involves  the  use  of  about  thirty 
rules  containing  approximately  forty  deductive  interactions.  Results  so  far  have 
been  promising.  Queries  requiring  the  use  of  six  or  seven  rules  average  about 
one  second  of  processing  time  which  includes  a call  on  the  data  management  system 
to  resolve  about  four  data-base  subgoals.  A more  comprehensive  set  of  rules  is 
being  developed  for  more  extensive  testing.  One  of  the  problems  in  generating  a 
large  number  of  rules  is  that  of  knowledge  acquisition  and  finding  experts  in 
particular  knowledge  domains  to  aid  in  the  generation  of  appropriate  and  meaningful 
rules. 

We  are  also  concerned  with  computational  comparisons  with  other  deductive 
systems.  But  such  comparisons  are  difficult  to  generate  and  obtain.  Our  system 
involves  the  generation  and  then  validation  of  proof  plans.  This  approach  is  in 
sharp  contrast  to  most  other  deductive  systems.  There  is  a difference  in  emphasis. 
Our  system  attempts  to  find  relevant  rules  before  those  rules  are  actually  used 
(verified).  Thus  it  would  be  difficult  to  compare  our  system  with,  for  example, 
resolution  techniques  which  have  comparison  measures  such  as  number  of  clauses 
generated,  proof  level,  etc.,  for  an  example  set  of  queries.  For  a small  num- 
ber of  rules,  other  deductive  approaches  may  be  more  efficient,  since  rule 
selection  may  be  less  important.  But  as  the  number  of  rules  Increases,  the  prob- 
lem of  rule  selection  becomes  more  significant,  and  we  feel  certain  that  some 
form  of  planning  must  be  done.  We  are  encouraged  by  our  progress  and  results, 
and  feel  that  the  planning  techniques  that  we  have  developed  hold  considerable 
promise  for  rule-based  deduction. 
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